Speech synthesis using non-uniform units in the Verbmobil project

نویسندگان

  • Simon King
  • Thomas Portele
  • Florian Höfer
چکیده

IN THE VERBMOBIL PROJECT Simon Kingy Thomas Portele Florian H ofer Institut f ur Kommunikationsforschung und Phonetik (IKP), Universit at Bonn Poppelsdorfer Allee 47, D-53115 Bonn, Germany http://www.ikp.uni-bonn.de ynow at the Centre for Speech Technology Research, University of Edinburgh, 80, South Bridge, Edinburgh EH1 1HN, GB http://www.cstr.ed.ac.uk email: [email protected] ABSTRACT We describe a concatenative speech synthesiser for British English which uses the HADIFIX [8] inventory structure originally developed for German by Portele. An inventory of non-uniform units was investigated with the aimof improving segmental quality compared to diphones. A combination of soft (diphone) and hard concatenation was used, which allowed a dramatic reduction in inventory size. We also present a unit selection algorithm which selects an optimum sequence of units from this inventory for a given phoneme sequence. The work described is part of the concept-to-speech synthesiser for the language and speech project Verbmobil [12] which is funded by the German Ministry of Science (BMBF).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Synthesis by word concatenation

Verbmobil is a speaker-independent system that offers translation assistance in dialogue situations. In co-operation with other institutes we are developing the speech synthesis module within Verbmobil for German and American English. Current priority is given to an enhancement of naturalness of our PSOLA based concatenative synthesis of German. Due to a tight schedule we investigated alternati...

متن کامل

Multilingual Generation for Translation in Speech-to-Speech Dialogues and its Realization in Verbmobil

This paper presents the generation module of the speech-to-speech dialogue translation system Verbmobil. Spontaneous speech, large multilingual vocabulary, difficulty of the translation task, robustness and real-time constraints make the design of such a module very challenging. In order to overcome these difficulties, we have developed a system based on a general kernel and the declarativity o...

متن کامل

Hierarchical non-uniform unit selection based on prosodic structure

In speech synthesis systems based on wave concatenation, using longer units can generate more natural synthetic speech. In order to improve the usage of longer units in the corpus, this paper proposed a hierarchical non-uniform unit selection framework. Each layer included in the framework is an independent searching procedure which searches for different sized units and adopts suitable natural...

متن کامل

Within-Word vs. Across-Word Decoding for Online Speech Recognition

In this paper we describe methods for improving the RWTH German speech recognizer used within the VERBMOBIL project. In particular, we present acceleration methods for the search based on both within-word and across-word phoneme models. The recognizer in the VERBMOBIL project is used in an online environment. We will discuss some incremental methods to reduce the response time of an on-line spe...

متن کامل

Adaptive manipulation of non-uniform synthesis units using multi-level unit transcription

A synthesis-by-rule system based on the selective use of non-uniform synthesis units has been developed. This system uses a natural speech database and an algorithm which searches the database for the optimal speech segment to be used as the synthesis unit. Because of flexible use of synthesis units, this scheme has great advantages, especially in expressing many coarticulat~ry variations. Howe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997